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3 <110> APPLICANT: Assistance Publique - Hopitaux de Paris (AH-HP) 

4 Institut National de la Sante et de la Recherche Medicale 

5 (INSERM) 

6 Institut Gustave Roussy (IGR) 

7 Universite de Versailles - Saint-Quentin-en-Yvelines 

8 Universite Paris-Sud 

9 VAINCHENKER, William 

10 UGO, Valerie 

11 JAMES , Chloe 

12 LE COUEDIC, Jean-Pierre 

13 CASADEVALL, Nicole 

15 <120> TITLE OF INVENTION: Identification of a JAK2 mutation involved in Vaquez 

16 Polyglobulia 

18 <130> FILE REFERENCE: D 22707 
C--> 20 <140> CURRENT APPLICATION NUMBER: US/10/580 , 458A 
C--> 21 <141> CURRENT FILING DATE: 2006-05-24 

23 <160> NUMBER OF SEQ ID NOS : 31 

25 <170> SOFTWARE : Patent In version 3.3 

27 <210> SEQ ID NO: 1 

28 <211> LENGTH: 1132 

2 9 <212> TYPE: PRT 

30 <213> ORGANISM: homo sapiens 

33 <220> FEATURE: 

34 <223> OTHER INFORMATION: variant JAK2 V617F 

3 6 <400> SEQUENCE: 1 
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390 










395 










400 


113 His 


Gly 


Pro 


He 


Ser 


Met 


Asp 


Phe 


Ala 


He 


Ser 


Lys 


Leu 


Lys 


Lys 


Ala 


114 








405 










410 










415 




116 Gly 


Asn 


Gin 


Thr 


Gly 


Leu 


Tyr 


Val 


Leu 


Arg 


Cys 


Ser 


Pro 


Lys 


Asp 


Phe 


117 






420 










425 










430 






119 Asn 


Lys 


Tyr 


Phe 


Leu 


Thr 


Phe 


Ala 


Val 


Glu 


Arg 


Glu 


Asn 


Val 


He 


Glu 


120 




435 










440 










445 








122 Tyr 


Lys 


His 


Cys 


Leu 


He 


Thr 


Lys 


Asn 


Glu 


Asn 


Glu 


Glu 


Tyr 


Asn 


Leu 


123 


450 
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138 



530 



535 



540 



140 Leu lie Phe Asn Glu Ser Leu Gly Gin Gly Thr Phe Thr Lys lie Phe 

141 545 550 555 560 

143 Lys Gly Val Arg Arg Glu Val Gly Asp Tyr Gly Gin Leu His Glu Thr 

144 565 570 575 

146 Glu Val Leu Leu Lys Val Leu Asp Lys Ala His Arg Asn Tyr Ser Glu 

147 580 585 590 

149 Ser Phe Phe Glu Ala Ala Ser Met Met Ser Lys Leu Ser His Lys His 

150 595 600 605 

152 Leu Val Leu Asn Tyr Gly Val Cys Phe Cys Gly Asp Glu Asn lie Leu 

153 610 615 620 

155 Val Gin Glu Phe Val Lys Phe Gly Ser Leu Asp Thr Tyr Leu Lys Lys 

156 625 630 635 640 

158 Asn Lys Asn Cys lie Asn lie Leu Trp Lys Leu Glu Val Ala Lys Gin 

159 645 650 655 

161 Leu Ala Trp Ala Met His Phe Leu Glu Glu Asn Thr Leu lie His Gly 

162 660 665 670 

164 Asn Val Cys Ala Lys Asn lie Leu Leu lie Arg Glu Glu Asp Arg Lys 

165 675 680 685 

167 Thr Gly Asn Pro Pro Phe lie Lys Leu Ser Asp Pro Gly lie Ser lie 

168 690 695 700 

170 Thr Val Leu Pro Lys Asp lie Leu Gin Glu Arg lie Pro Trp Val Pro 

171 705 710 715 720 

173 Pro Glu Cys lie Glu Asn Pro Lys Asn Leu Asn Leu Ala Thr Asp Lys 

174 725 730 735 

176 Trp Ser Phe Gly Thr Thr Leu Trp Glu lie Cys Ser Gly Gly Asp Lys 

177 740 745 750 

179 Pro Leu Ser Ala Leu Asp Ser Gin Arg Lys Leu Gin Phe Tyr Glu Asp 

180 755 760 765 

182 Arg His Gin Leu Pro Ala Pro Lys Trp Ala Glu Leu Ala Asn Leu lie 

183 770 775 780 

185 Asn Asn Cys Met Asp Tyr Glu Pro Asp Phe Arg Pro Ser Phe Arg Ala 

186 785 790 795 800 

188 lie lie Arg Asp Leu Asn Ser Leu Phe Thr Pro Asp Tyr Glu Leu Leu 

189 805 810 815 

191 Thr Glu Asn Asp Met Leu Pro Asn Met Arg lie Gly Ala Leu Gly Phe 

192 820 825 830 

194 Ser Gly Ala Phe Glu Asp Arg Asp Pro Thr Gin Phe Glu Glu Arg His 

195 835 840 845 

197 Leu Lys Phe Leu Gin Gin Leu Gly Lys Gly Asn Phe Gly Ser Val Glu 

198 850 855 860 

2 00 Met Cys Arg Tyr Asp Pro Leu Gin Asp Asn Thr Gly Glu Val Val Ala 
201 865 870 875 880 

2 03 Val Lys Lys Leu Gin His Ser Thr Glu Glu His Leu Arg Asp Phe Glu 
204 885 890 895 

206 Arg Glu lie Glu lie Leu Lys Ser Leu Gin His Asp Asn lie Val Lys 

207 900 905 910 

2 09 Tyr Lys Gly Val Cys Tyr Ser Ala Gly Arg Arg Asn Leu Lys Leu lie 
210 915 920 925 
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212 Met Glu Tyr Leu Pro Tyr Gly Ser Leu Arg Asp Tyr Leu Gin Lys His 

213 930 935 940 

215 Lys Glu Arg lie Asp His lie Lys Leu Leu Gin Tyr Thr Ser Gin lie 

216 945 950 955 960 

218 Cys Lys Gly Met Glu Tyr Leu Gly Thr Lys Arg Tyr lie His Arg Asp 

219 965 970 975 

221 Leu Ala Thr Arg Asn lie Leu Val Glu Asn Glu Asn Arg Val Lys lie 

222 980 985 990 

224 Gly Asp Phe Gly Leu Thr Lys Val Leu Pro Gin Asp Lys Glu Tyr Tyr 

225 995 1000 1005 

227 Lys Val Lys Glu Pro Gly Glu Ser Pro lie Phe Trp Tyr Ala Pro 

228 1010 1015 1020 

230 Glu Ser Leu Thr Glu Ser Lys Phe Ser Val Ala Ser Asp Val Trp 

231 1025 1030 1035 

233 Ser Phe Gly Val Val Leu Tyr Glu Leu Phe Thr Tyr He Glu Lys 

234 1040 1045 1050 

236 Ser Lys Ser Pro Pro Ala Glu Phe Met Arg Met He Gly Asn Asp 

237 1055 1060 1065 

239 Lys Gin Gly Gin Met He Val Phe His Leu He Glu Leu Leu Lys 

240 1070 1075 1080 

242 Asn Asn Gly Arg Leu Pro Arg Pro Asp Gly Cys Pro Asp Glu He 

243 1085 1090 1095 

245 Tyr Met He Met Thr Glu Cys Trp Asn Asn Asn Val Asn Gin Arg 

246 1100 1105 1110 

24 8 Pro Ser Phe Arg Asp Leu Ala Leu Arg Val Asp Gin He Arg Asp 
249 1115 1120 1125 

251 Asn Met Ala Gly 

252 1130 

254 <210> SEQ ID NO: 2 

255 <211> LENGTH: 5097 

256 <212> TYPE: DNA 

257 <213> ORGANISM: homo sapiens 

260 <220> FEATURE: 

261 <223> OTHER INFORMATION: G1849T mutation in jak2 gene 

263 <400> SEQUENCE: 2 

264 ctgcaggaag gagagaggaa gaggagcaga agggggcagc agcggacgcc gctaacggcc 60 
266 tccctcggcg ctgacaggct gggccggcgc ccggctcgct tgggtgttcg cgtcgccact 120 
268 tcggcttctc ggccggtcgg gcccctcggc ccgggcttgc ggcgcgcgtc ggggctgagg 180 
270 gctgctgcgg cgcagggaga ggcctggtcc tcgctgccga gggatgtgag tgggagctga 240 
272 gcccacactg gagggccccc gagggcccag cctggaggtc gttcagagcc gtgcccgccc 3 00 
274 cggggcttcg cagaccttga cccgccgggt aggagccgcc cctgcgggct cgagggcgcg 360 
276 ctctggtcgc ccgatctgtg tagccggttt cagaagcagg caacaggaac aagatgtgaa 420 
278 ctgtttctct tctgcagaaa aagaggctct tcctcctcct cccgcgacgg caaatgttct 480 
280 gaaaaagact ctgcatggga atggcctgcc ttacgatgac agaaatggag ggaacatcca 540 
282 cctcttctat atatcagaat ggtgatattt ctggaaatgc caattctatg aagcaaatag 600 
284 atccagttct tcaggtgtat ctttaccatt cccttgggaa atctgaggca gattatctga 660 
286 cctttccatc tggggagtat gttgcagaag aaatctgtat tgctgcttct aaagcttgtg 720 
288 gtatcacacc tgtgtatcat aatatgtttg ctttaatgag tgaaacagaa aggatctggt 780 
290 atccacccaa ccatgtcttc catatagatg agtcaaccag gcataatgta ctctacagaa 840 



file ://C: \CRF4\Outhold\VsrJ5804 58A.htm 8/22/2 006 



Page 5 of 8 



RAW SEQUENCE LISTING DATE: 08/22/2006 

PATENT APPLICATION: US/10/580 , 458A TIME: 10:54:30 

Input Set : A:\65691-445CorrSeqList.txt 
Output Set: N:\CRF4\08222006\J580458A.raw 

292 taagatttta ctttcctcgt tggtattgca gtggcagcaa cagagcctat cggcatggaa 900 

294 tatctcgagg tgctgaagct cctcttcttg atgactttgt catgtcttac ctctttgctc 960 

296 agtggcggca tgattttgtg cacggatgga taaaagtacc tgtgactcat gaaacacagg 1020 

298 aagaatgtct tgggatggca gtgttagata tgatgagaat agccaaagaa aacgatcaaa 1080 

300 ccccactggc catctataac tctatcagct acaagacatt cttaccaaaa tgtattcgag 1140 

302 caaagatcca agactatcat attttgacaa ggaagcgaat aaggtacaga tttcgcagat 1200 

304 ttattcagca attcagccaa tgcaaagcca ctgccagaaa cttgaaactt aagtatctta 1260 

306 taaatctgga aactctgcag tctgccttct acacagagaa atttgaagta aaagaacctg 1320 

308 gaagtggtcc ttcaggtgag gagatttttg caaccattat aataactgga aacggtggaa 1380 

310 ttcagtggtc aagagggaaa cataaagaaa gtgagacact gacagaacag gatttacagt 1440 

312 tatattgcga ttttcctaat attattgatg tcagtattaa gcaagcaaac caagagggtt 1500 

314 caaatgaaag ccgagttgta actatccata agcaagatgg taaaaatctg gaaattgaac 1560 

316 ttagctcatt aagggaagct ttgtctttcg tgtcattaat tgatggatat tatagattaa 1620 

318 ctgcagatgc acatcattac ctctgtaaag aagtagcacc tccagccgtg cttgaaaata 1680 

32 0 tacaaagcaa ctgtcatggc ccaatttcga tggattttgc cattagtaaa ctgaagaaag 1740 

322 caggtaatca gactggactg tatgtacttc gatgcagtcc taaggacttt aataaatatt 1800 

324 ttttgacttt tgctgtcgag cgagaaaatg tcattgaata taaacactgt ttgattacaa 1860 

326 aaaatgagaa tgaagagtac aacctcagtg ggacaaagaa gaacttcagc agtcttaaag 1920 

328 atcttttgaa ttgttaccag atggaaactg ttcgctcaga caatataatt ttccagttta 1980 

330 ctaaatgctg tcccccaaag ccaaaagata aatcaaacct tctagtcttc agaacgaatg 2040 

332 gtgtttctga tgtaccaacc tcaccaacat tacagaggcc tactcatatg aaccaaatgg 2100 

334 tgtttcacaa aatcagaaat gaagatttga tatttaatga aagccttggc caaggcactt 2160 

336 ttacaaagat ttttaaaggc gtacgaagag aagtaggaga ctacggtcaa ctgcatgaaa 2220 

338 cagaagttct tttaaaagtt ctggataaag cacacagaaa ctattcagag tctttctttg 2280 

340 aagcagcaag tatgatgagc aagctttctc acaagcattt ggttttaaat tatggagtat 2340 

342 gtttctgtgg agacgagaat attctggttc aggagtttgt aaaatttgga tcactagata 2400 

344 catatctgaa aaagaataaa aattgtataa atatattatg gaaacttgaa gttgctaaac 2460 

346 agttggcatg ggccatgcat tttctagaag aaaacaccct tattcatggg aatgtatgtg 2520 

348 ccaaaaatat tctgcttatc agagaagaag acaggaagac aggaaatcct cctttcatca 2580 

350 aacttagtga tcctggcatt agtattacag ttttgccaaa ggacattctt caggagagaa 2640 

352 taccatgggt accacctgaa tgcattgaaa atcctaaaaa tttaaatttg gcaacagaca 2700 

354 aatggagttt tggtaccact ttgtgggaaa tctgcagtgg aggagataaa cctctaagtg 2 760 

356 ctctggattc tcaaagaaag ctacaatttt atgaagatag gcatcagctt cctgcaccaa 2820 

358 agtgggcaga attagcaaac cttataaata attgtatgga ttatgaacca gatttcaggc 2880 

360 cttctttcag agccatcata cgagatctta acagtttgtt tactccagat tatgaactat 2940 

362 taacagaaaa tgacatgtta ccaaatatga ggataggtgc cctagggttt tctggtgcct 3000 

364 ttgaagaccg ggatcctaca cagtttgaag agagacattt gaaatttcta cagcaacttg 3060 

366 gcaagggtaa ttttgggagt gtggagatgt gccggtatga ccctctacag gacaacactg 3120 

368 gggaggtggt cgctgtaaaa aagcttcagc atagtactga agagcaccta agagactttg 3180 

3 70 aaagggaaat tgaaatcctg aaatccctac agcatgacaa cattgtaaag tacaagggag 3240 

372 tgtgctacag tgctggtcgg cgtaatctaa aattaattat ggaatattta ccatatggaa 3300 

374 gtttacgaga ctatcttcaa aaacataaag aacggataga tcacataaaa cttctgcagt 3360 

3 76 acacatctca gatatgcaag ggtatggagt atcttggtac aaaaaggtat atccacaggg 342 0 

378 atctggcaac gagaaatata ttggtggaga acgagaacag agttaaaatt ggagattttg 3480 

380 ggttaaccaa agtcttgcca caagacaaag aatactataa agtaaaagaa cctggtgaaa 3540 

382 gtcccatatt ctggtatgct ccagaatcac tgacagagag caagttttct gtggcctcag 3600 

384 atgtttggag ctttggagtg gttctgtatg aacttttcac atacattgag aagagtaaaa 3660 

386 gtccaccagc ggaatttatg cgtatgattg gcaatgacaa acaaggacag atgatcgtgt 3720 

3 88 tccatttgat agaacttttg aagaataatg gaagattacc aagaccagat ggatgcccag 3 780 
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Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each seoyaaee— which presents at least one n or Xaa. 



Seq#:13; N Pos . 20,21, 
Seq#:14; N Pos. 20,21 
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VERIFICATION SUMMARY DATE: 08/22/2006 

PATENT APPLICATION: US/10/580 , 458A TIME: 10:54:31 

Input Set : A:\65691-445CorrSeqList.txt 
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L:20 M:270 C: Current Application Number differs, Replaced Current Application Number 
L:21 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:599 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 after pos . : 0 
L:622 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:14 after pos . : 0 
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